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Abstract 

Glucosinolates are major secondary metabolites found in the Brassicaceae family. These compounds play an essential role in 
plant defense against biotic and abiotic stresses, but more interestingly they have beneficial effects on human health. We 
performed a genetic analysis in order to identify the genome regions regulating glucosinolates biosynthesis in a DM 
mapping population of Brassica oleracea. In order to obtain a general overview of regulation in the whole plant, analyses 
were performed in the three major organs where glucosinolates are synthesized (leaves, seeds and flower buds). Eighty two 
significant QTLs were detected, which explained a broad range of variability in terms of individual and total glucosinolate 
(GSL) content. A meta-analysis rendered eighteen consensus QTLs. Thirteen of them regulated more than one glucosinolate 
and its content. In spite of the considerable variability of glucosinolate content and profiles across the organ, some of these 
consensus QTLs were identified in more than one tissue. Consensus QTLs control the GSL content by interacting epistatically 
in complex networks. Based on in silico analysis within the B. oleracea genome along with synteny with Arabldopsis, we 
propose seven major candidate loci that regulate GSL biosynthesis in the Brassicaceae family. Three of these loci control the 
content of aliphatic GSL and four of them control the content of indolic glucosinolates. GSL-ALK plays a central role in 
determining aliphatic GSL variation directly and by interacting epistatically with other loci, thus suggesting its regulatory 
effect. 
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Introduction 

The Brassica genus includes six agricultural important species 
which are grown in many countries, and important oil, condiment 
and vegetable crops. Brassica vegetables like broccoli, cabbage, 
Chinese cabbage, turnip greens and leaf rape, among others, are 
consumed throughout the world. FAO Statistics (FAOStat 2011) 
show that the production of cauliflower, broccoli, kales and other 
crucifers was 8.2% of the total vegetable production of the world 
in 20 1 1 . The most consumed crop of this genus in Europe and the 
USA is Brassica oleracea. This species includes cabbages, kales, 
broccoli and cauliflower, among others. 

Glucosinolates (GSLs) are the major class of secondary 
metabolites found in the Brassicaceae falUy, including the Brassica 
genus. The hydrolytic breakdown products of GSLs (especially 
isothiocyanates) have beneficial effects on human health, such as 
cytotoxic and apoptotic effects in damaged cells, thus preventing 
cancer in humans and reducing the risk for degenerative diseases 
[1-3]. They also enhance plant protection to abiotic and biotic 
stresses [4]. GSLs could exhibit certain adverse effects. For 
example, progoitrin can cause goiter in animals [5], which 
provoked the deliberate reduction of GSL levels in B. napus in 
the past. However, there is no evidence of any goitrogenic effect 



coming from Brassica consumption in humans [6]. Currently, 
efforts are concentrated on increasing the level of health 
promoting GSLs in Brassica crops. For example Sarikamis et al. 
[7] selected broccoli for higher levels of 3-methylsulphinylpropyl 
(GIB) and 4-methylsulphLnylbutyl (GRA), which are the precursors 
of the isothiocyanates called iberin and suUbraphane, respectively. 
The beneficial effects of both isothiocyanates on human health are 
well known, having an influence on carcinogenesis during the 
initiation and promotion phases of cancer development [8]. 
Knowledge on the genetics underlying the synthesis and accumu- 
lation of GSLs in Brassica crops is an important tool for designing 
appropriate strategies in order to increase the content of those 
GSLs related to human health and plant protection. 

GSLs are divided into three different classes according to the 
amino acid precursor in biosynthesis: (1) aliphatic GSLs derived 
from alanine (Ala), leucine (Leu), isoleucine (lieu), valine (V al), and 
methionine (Met); (2) aromatic GSLs derived from phenylalanine 
(Phe) and tyrosine (Tyr) and (3) indolic GSLs derived from 
tryptophan (Tip) [9] . In Arahidopsis thaliana and Bras.nca crops, most 
GSLs are synthesized from Met. GSL biosynthesis is a tripartite 
pathway involving three independent steps (Fig. lA): (i) side chain 
elongation of some precursor amino acids such as Met and Phe, by 
adding one or several methylene groups. Chain elongation is 
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carried out by methylthioalkylmalate synthase enzymes (MAM), 
(ii) Development of the core structure, which includes several steps: 
aldoxime formation catalyzed by the CYP79 family of cyto- 
chromes P450; aldoxime oxidation by the CYP83 family; 
thiohydroximic acid formation by conjugation to an S donor 
and after C-S bond cleavage; desulfoGLS formation by S- 
glucosyltransferase (S-GT); and GSL formation by sulfotransfer- 
ase. (iii) Secondary modification of the amino acid side chain 
which includes oxidation, hydroxylation, methoxylation, desatu- 
ration, sulfation, and glycosylation [10,11]. 

To date, major genes and transcription factors involved in the 
three steps of GSL biosynthesis have been identified and 
characterized in the model plant, A. thaliana. Based on A. thaliana 
homology, three loci were identified in B. olemcea and cloned [12- 
1 4] : two loci responsible for the elongation of the side chain of 
aliphatic GSLs named BoGSL-ELONG and BoGSL-PRO 
(homologous to MAM-1 and MAM-2 genes, respectively of 
Ambidopsis) and one locus responsible for side the chain desatura- 
tion and production of an alkenyl GSL named BoGSL-ALK 
(homologous to AOP2 gene of Arabidopsis). Afterwards, these loci, 
plus genes BoCS-lyase, BoGS-OH and BoCYP79Fl, were 
mapped [15]. However, genes responsible for other steps of the 

A B 



metabolic pathway remain undiscovered. Identification of meta- 
bolic QTLs (QTLs) is essential for the understanding of the 
quantitative genetic control of secondary metabolites and it is an 
early step to identify the genes underlying trait variation. The high 
co-linearity between A. thaliana and Brassica species can be used in 
order to identify candidate genes underlying QTLs that affect GSL 
content. In addition to identilying structural and accumulation 
QTLs, it is important to determine the extent of epistatic 
interactions between loci which may play an important role in 
determining variability for GSL content. 

The accumulation and profile of GSLs in plants are highly 
dependent on the genotype, although it is also affected by 
environmental and developmental factors. In Ambidopsis, GSL 
profiles have been systematically monitored during plant devel- 
opment and vary significantly among tissues and organs [16-19]. 
In B. oleracea, developmental stages and the type of tissues may 
modify tiie type of GSLs and its levels [20,21]. Currendy, little is 
known about the genetics of GSL content within the plant 
ontogeny. For this reason, it is necessary to develop a better 
understanding of the genetics underlying GSL biosynthesis and 
accumulation in different tissues in B. oleracea. 
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Figure 1. Formation of the core structure of the three major groups of glucosinolates in A.thaliana, including the genes controlling 
this process [11,43]. (A). A biochemical genetic model of the biosynthesis of aliphatic glucosinolates in Brassicaceae including the major genes 
controlling this process [57] (B). 
doi:1 0.1 371/journal.pone.0091 428.g001 
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In the present study we identify QTLs for GSL composition and 
accumulation in B. okracea leaves, flower buds and seeds in a 
double haploid (DH) population. We also perform a comparative 
genomic analysis based on A. thaliana-B. okracea synteny in order to 
find candidate genes underlying QTL variation. Epistatic 
relationships among QTLs are also described. This information 
may increase the understanding on the quantitative genetic control 
of these traits and it is useful in order to identify genes controlling 
GSLs in B. okracea. 

Materials and Methods 

Plant material and growing environments 

A double haploid (DH) mapping population (BolTBDH) was 
employed in this work. The population was created from an Fi 
individual, from a cross between a DH rapid cycling of Chinese 
kale (TO1000DH3, Pj) and a DH broccoli line 'Early Big' (Pj) 
[22]. TO1000DH3 is the reference genome for the B. okracea 
sequencing project. Firsdy, parents and 155 DH lines were grown 
and selfed in the greenhouse in 2010 under: 16 h of daylight and a 
temperature of 24±2°C; 8 h of darkness having 18±2°C at night; 
and a relative humidity of 55% in order to obtain enough seed in 
the same environmental conditions. Selfmg was carric'd out by 
bagging each individual plant inside a microperforated polyeth- 
ylene bags. Five bulks of 10 mg of seed for each line were prepared 
for GSL analysis with the seeds obtained. In 2011 (from 
September to November), seeds from parents and 155 DH lines 
were sown with the same photoperiod and temperature as in 2010. 
Plants were sown in a completely randomized experiment with 
two replications and 4 plants per replication and DH line. 

From each line, leave samples were taken at the 4 leaf stage and 
flower buds were taken difTerentially depending on the flowering 
time of each plant. One bulk was taken from each replication by 
mixing the four samples of leaves and flower buds. Samples were 
immediately frozen in liquid N2, transferred to the laboratory' and 
conserved at -80°C until processing. All samples were lyophilized 
(BETA 2-8 LD plus, Christ) during 72 h. The dried material was 
powdered by using an IKA-AIO (IKA-Werke GmbH & Co.KG) 
miU, and the fine powder was used for GSL extraction. 

GSL identification and quantification 

Sample extraction and desulfation were performed according to 
Khebenstein et al. [23] with minor modifications. Three microliters 
of the desulfo-GSL extract for seeds and 5 |il for leaves and flower 
buds were used in order to identify and quantify GSLs. 
Chromatographic analyses were carried out on an Ultra-High- 
Performance Liquid-Chromatograph (UHPLC Nexera LC-30AD; 
Shimadzu) equipped with a Nexera SIL-30AC injector and one 
SPD-M20A UV/VIS photodiode array detector. The UHPLC 
column was a C18 Atlantis T3 waters column (3 lim particle size, 
2.1 X 1 00 mm i.d.) protected with a C 1 8 guard cartridge. The oven 
temperature was set at 30°C. Compounds were dctci tc-d at 229 
nm and were separated by using the following method in aqueous 
acetonitrile, with a flow of 0.8 mL min~': 1.5 minutes at 
lOOyoHjO; a 1 1 min gradient from 0% to 25% (v/v) acetonitrile; 
1.5 min at 25% (v/v) acetonitrile; a minute gradient from 25% to 
0% (v/v) acetonitrile; and a final 3 min at 100% H2O. Data were 
recorded on a computer with the LabSolutions software 
(Shimadzu). Specific GSLs were identified by comparing retention 
times with standards and by UV absorption spectra. 

GSLs were quantified at 229 nm by using sinigrin (SIN, sinigrin 
monohydrate from Phytoplan, Diehm& Neuberger GmbH, 
Heidelberg, Germany) and glucobrassicin (GBS, glucobrassicin 
potassium salt monohydrate, from Phytoplan, Diehm& Neuberger 



GmbH, Heidelberg, Germany) as external standards and 
expressed in nmol g ' dry weight (DW). Calibration equations 
were made with, at least, five data points, from 0.34 to 1.7 nmol 

for sinigrin and from 0.28 to 1.4 nmol for glucobrassicin. The 
average regression equations for sinigrin and glucobrassicin were y 
= 148818 X (r2 = 0.99) and y = 263822 x(R2= 0.99), 
respectively. 

Statistical analysis 

A combined analysis of variance across organs and individual 
analyses of variance for each organ were made for individual and 
total GSL. Lines and organs were considered as fixed factors and 
replications were considered as random factors. Analysis of 
variance was performed with the PROC GLM of SAS [24]. 

The genetic map employed for the QTL analysis was created by 
Iniguez-Luy et al. [22] having 279 markers (SSRs and RFLPs) 
distributed along nine linkage groups (C 1-C9) with a total distance 
of 891.4 cM and a marker density of 3.2 cM/marker. Eight 
primer pairs described by Gao et al. [15] amplifying loci BoGSL- 
ELONG, BoGSL-ALK, BoGSL-PROa, BoGSL-PRO-b, BoCS- 
lyase, BoGS-OH, BoCYP79Fl and BoS-GT from B. okracea were 
screened in parent DH lines. Besides, SSRs Gil2 Hasan et al. [25] 
and O112-D05 [26] were screened in parental DH lines. SSRs 
Gil2 and O112-D05 map in both sides of ATRl gene of^. thaliana 
in chromosome 5 [25]. Amplifications were performed by 
following Gao et al. [15] and electrophoresis was carried out in 
1 % agarose gels and capillary electrophoresis system (CEQ_ 8000 
Beckman, Coulter). Polymorphic markers were then screened in 
the BolTBDH mapping population, scored and assigned to linkage 
groups withJoinMap 3.0 sofware [27]. The threshold for assigning 
markers to linkage groups was a LOD score between 5 and 8. 

Quantitative trait locus mapping was carried out thanks to a 
composite interval mapping method [28] by using the PLABQTL 
program [29]. In each organ (leaves, flower buds and seeds), 
analyses were carried out on each individual GSL and for each 
GSL type (aliphatic, indolic and aromatic) as well as on the total 
GSLs. A likelihood odds (LOD) threshold of 3.2 was chosen in 
order to declare a putative QTL significant by following the 
method described by Van Ooijen [30]. The confidence intervals 
were set at 95%. The analysis and cofactor election were carried 
out by following PLABQTL's recommendations, by using an 
T-to-enter' and an T-to-delete' value of 7. 

The proportion of phenotypic variance explained for a specific 
trait was determined by the adjusted coefficient of determination 
of regression (R^) fitting a model including all detected QTLs [31]. 
Fivefold cross-validation of QTLs was performed by following the 
procedures described by Utz et al. [32]. The frequency of QTL 
detection gives us an estimation of the precision of QTL 
localization. 

Significant QTLs for individual GSLs were integrated by using 
a QTL meta-analysis with BioMercator 2.1 software in order to 
give consensus QTLs [33]. An Akaike-type statistical criterion 
(AIC value) indicated the model which best fitted the data, 
including the number and the consensus QTLs positions. The aim 
of performing a meta-analysis was to find if a genomic region 
could determine the GSL content of different GSLs and if the 
same QTL was present in the three organs under study. 

Iniguez-Luy et al. [22] identified coUinear genomic blocks 
between the BolTBDH mapping population and A. thaliana by 
using a synteny analysis. This information was employed in order 
to identify candidate genes that may directly account for GSL 
QTLs in B. okracea. In following this approach, we tried to locate 
46 genes involved in GSL metabolism in A. thaliana which were 
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obtained from TAIR (The Arabidopsis Information Resource) on 
the BolTBDH map by in silico mapping. 

Epistatic interaction analysis among QTLs was performed by 
using the R/ qtl package of the R software [34] . 

Results 

Phenotypic variation in GSL content 

Twelve GSLs, belonging to three chemical classes, were 
detected in the BolTBDH population (Table 1). Eight GSL were 
aliphatic, three of them belonging to the 3C group: 3- 
methylthiopropyl (GIV), 3-methylsulfinylpropyl (GIB) and 2- 
propenyl (SIN); four belonging to the 4C group: 4-methylthiobutyl 
(GER), 4-methylsulfmylbutyl (GRA), 3-butenyl (GNA) and 

2- hydroxy-3-butenyl (PRO); and one belonging to the 5C group: 
5-methylsulfinylpentyl (ALY). Three indolic GSLs: 4-hydroxy- 

3- indolylmethyl (OHGBS), 3-indolylmethyl (GBS); and 1-me- 
thoxy-3-indolylmethyl (NeoGBS), and one aromatic GSL, 
2-phenylethyl (GNT), were also detected. 

Dilferent GSL profiles were detected in the parental lines. The 
following aliphatic GSLs were found in Pj (TO1000DH3) in 
different organs: GIV, GIB, SIN GER, GRA, GNA, and PRO. 
Aliphatic GER and GRA and PRO were detected in Pj ('Early 
Big' broccoli) meantime aliphatic ALY was found in the mapping 
population but it was not detected in its parents. Therefore, 30 
and 4C GSLs were found in Pi, while only 4C GSLs were found in 
Pa. Alkenyl GSLs (SIN, GNA and PRO) were found in Pi but not 
in P2 (only trace amounts of PRO in flower buds) (Table 1). 



The GSL profde of the mapping population varied depending 
on the organ. In leaves, 55.2% of GSLs were indolic and 40.2% of 
GSLs were aliphatic, being NeoGBS and GRA the major GSLs 
respectively. In seeds, 93.3% of total GSLs were aliphatic, and 
GRA, GNA and PRO were the major GSLs. The GSL profile of 
flower buds was intermediate among leaves and seeds as 67.7% of 
total GSLs were aliphatic and 28.6% were indolic. GRA, GNA 
and NeoGBS were the major GSLs in this organ. GIV and ALY 
were exclusively found in seeds, meanwhile GER was only found 
in flower buds and seeds (Table 1). 

Aliphatic GSL content in Pi was higher than that found in P2 in 
the three organs analyzed (Table 1). SIN and GNA were the major 
aliphatic GSLs found in the three organs for Pi In contrast GRA 
was the major GSL in P2 in the three organs. Regarding indolic 
GSLs, GBS and NeoGBS were found as the most abundant in 
both parents in both leaves and flower buds, while OHGBS was 
the major GSL found in seeds. Indolic GSL content was higher in 
P2 compared to Pi in both leaves and flower buds. Total GSL 
content in Pi was higher than that found in P2 leaves and seeds 
(Table 1). 

In the mapping population, the content of individual GSLs as 
well as the content of aliphatic, indolic and total GSLs showed 
continuous distributions. Extreme phenotypes were found for all 
traits, with the exception of GNT in leaves, compared to 
phenotypes observed in parent lines (Table 1). For example, 
extreme mean values of some individual GSL content in the 
mapping population are far beyond the content of any of the 
parents. For instance, GRA content in seeds was 0.72 |J,mol g~'dw 
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Figure 2. Framework map of DH population showing eighty-two metabolic quantitative trait loci (QTL) for individual GSLs and 
sums of GSLs. Linkage groups were labeled by following the nomenclature of Iniguez-Luy ef a\. [22], Bars represent the LOD confidence interval of 
each QTL. QTLs are in different colors depending on the plant organ: leaves (green), flower buds (red) and seeds (blue). After the name of each QTL, 
-P, indicates allele from DH rapid cycling of Chinese kale (TO1000DH3) and -P2 indicates allele from DH broccoli line 'Early Big'. 
doi:1 0.1 371/journal.pone.0091 428.g002 
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in Pi and 21.69 nmol g Mw in P2. The average GRA content in 
the mapping population was 22.62 |J,mol g 'dw and ranged from 
0.48 to 74.14 nmol g"'dw (Table 1). Total GSL content in the 
different organs varied nearly 18-fold within the mapping 
population. The average content of total GSLs was 4.01 |tmol 
g Mw in leaves, 10.13 |J,mol g 'dw in flower buds and 83.3 (imol 
g 'dw in seeds (Table 1). 

Analysis of variance 

Significant organ x line interactions were found for all traits, 
therefore individual analyses were carried out by organ. The 
source of variation due to lines was highly sigruficant for the most 
traits, except ALY and OHGBS in leaves and GIV and NeoGBS 
in seeds. The source of variation due to replications was in most 
cases non significant (data not shown). 

QTL analysis 

Three out of eight primer pairs designed by Gao et al. [15] were 
polymorphic in of the mapping population's parents. These 
markers could be mapped and located in three different linkage 
groups. BoGSL-OH mapped on C4 (28.8 cM), B()CYP79F1 
mapped on C5 (102 cM) and BoGSL-PROb mapped on C8 
(66 cM). SSRs OL12-D05 and Gil 2 were also polymorphic and 
they mapped on C8 (49 cM) and C9 (40 cM), respectively. QTL 
analyses were carried out with 279 markers designed by Iniguez - 
Luy and the five newly mappc-d primer pairs. No significant QTL 
was detected in any of the map positions where BoGSL-OH, 
BoCYP79Fl and BoGSL-PROb were located (Fig. 2). 

Eighty-two significant QTLs were detected being spread all over 
the 9 linkage groups of _B. olemcea. The number of QTLs by linkage 
group ranged between two in CI and 19 in C9 (Fig. 2). Twenty 
significant QTLs were found in leaves. The value of R^ ranged 
between 10.3% for GNA in G7 and 34.3% for the sum of aliphatic 
GSLs in C7 (Table SI). Half of QTLs had a frequency of cross- 
validation higher than 50%. Twenty-nine significant QTLs were 
detected in flower buds. R^ value ranged between 10.4% for the 
sum of aliphatic GSLs in C3 and 49.7'/'o for the sum of aliphatic 
GSLs in C9, respectively. Eighteen QTLs had a frequency of 
cross-validation higher than 50%. Thirty-three significant QTLs 
were found in seeds. R^ value varied between 10.3% for the sum of 
indolic GSLs in C6 and 49.4% for ALY in C5. Twenty-eight 
QTLs had a frequency of cross-validation higher than 50%. 

Consensus QTLs 

Based on the position of the QTLs and taking into account their 
confidence interval, a meta-analysis in order to render consensus 
QTLs for GSL concentration was carried out. Eighteen consensus 
QTLs were detected (Table 2). Fourteen consensus QTLs were 
present in seeds, 12 QTLs in leaves and 14 QTLs in flower buds. 
Seven QTLs were common to flower buds, leaves and seeds; three 
QTLs were exclusively found in leaves, two QTLs were exclusively 
found in flower buds and other two QTLs were exclusive found in 
seeds. In order to make the discussion clearer, results regarding 
consensus QTLs are going to be presented according to each 
chemical GSL class. 

Aliphatic GSLs 

Located in C3, consensus QTL-3.1 controls the content of PRO 
and GNA in the three organs (Table 2). Alleles for increasing PRO 
content are given by Pi, while alleles for increasing GNA content 
are given by P2 (Fig. 2). Consensus QTL-5.1, located in C5, 
controls the content of GIB and SIN in the three organs. Alleles 
for increasing the content of both GSLs are given by Pi. In C9, 



consensus QTL-9.2, which controls the content of PRO, GNA, 
GRA, GER (4C-GSL) and SIN, and GIB (3C-GSL) in the three 
organs, was located. Alleles for synthesis of PRO, SIN and GNA 

are given by P 1 , while alleles for increasing the content of GRA, 
GER and GIB are given by Pg (Fig. 2). Other QTLs which control 
aliphatic GSL content exclusively are QTL- 1.1, QTL-2.2, QTL- 
3.1, QTL-3.2, QTL-3.4, QTL-4.2 and QTL-7.2. 

Indolic and aromatic GSLs 

Several consensus QTLs only controlled the indolic GSL 
content. QTL-1.2, QTL-3.3, QTL-4.1 and QTL7.4 determined 
the GBS content in seeds and flower buds (Table 2). Alleles for 
increasing the content of GBS are given by P2 in all these QTLs 
except for QTL-3.3, where alleles came from both parents. 
Consensus QTL-2.1 determines the content of OHGBS and GBS 
in seeds and flower buds. The allele for increasing OHGBS is 
given by P2 in flower buds, while the allele for increasing GBS 
content is given by Pi. Consensus QTL-8.1 determines the 
OHGBS, NeoGBS and total indohc GSL content in the three 
organs. Besides, this QTL also controls the content of the aromatic 
GNT. Other QTLs for GNT content are QTL5.2 and QTL7.1. 
The genomic regions QTL-1.2, QTL-2.2 and QTL-7.4 are 
coUinear with genomic regions of A. thalima in chromosomes 4, 5 
and 2. In these regions, genes CYP83B1, CYP81F2 and CYP79B3 
from A. thaliana were found by means of in silico mapping. 

Eplstatlc networks 

A total of 85 significant epistatic interactions were found when 
taking into account the three organs and all the traits. Thirteen 
epistatic interactions were found in leaves, 52 in flower buds and 
1 3 in seeds. Some of these interactions are common to the three 
organs under study. Sixty-eight interactions were detected in 
aliphatic GSLs, 13 in indolic GSLs and 4 in total GSLs. An 
average of 3.5 significant epistatic interactions was found per trait 
(Fig. SI). 

Forty-two interactions were detected between QTLs, being two 
of them negative. Twenty interactions were detected between 
QTL9.2 (proposed as GSL-ALK in this work) and other QTLs in 
traits related to aliphatic GSLs (Fig. 3). The relationship between 
QTL9.2 and QTL 3.1 (proposed as GSL-OH) was found for the 
aliphatic GNA, PRO, GER and GIB in the three organs under 
study. The relationship between QTL9.2 and QTL5.1 (proposed 
as GSL-PRO) was found for die afiphatic GER, SIN and GNA in 
the three organs (Fig. 3). In the network controlled by GSL-ALK, 
interactions between aliphatic and indolic QTLs were observed. 
For example, QTLs 3.3, 4.1 and 9.1 control the GBS content and 
the three of them interact with QTL 9.2 in order to produce 
aliphatic GSLs (Fig. 3). 

Discussion 

Phenotypic variation in GSL content 

Parents of the mapping population had different GSL profiles 
and concentration. Particularly, parent Pg has a higher concen- 
tration of GRA and a lower concentration of GNA than parent Pi 
in the three organs. GRA is found in several B. oleracea crops like 
cauliflowc-r. cabbage and kale, although high levels of GRA 
erjui\ alcnt to those found in P2 ('Early Big' broccoli) are always 
found in broccoli [35-38]. The effect of sulforaphane, the 
isothiocyanate derived from GRA, against cancer has been 
reviewed in detail [8,39]. As a result of these epidemiological 
and biomedical studies, GRA is now viewed as a quaUty trait in B. 
oleracea crops to be targeted in breeding programs. 
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Figure 3. An epistatic network including all the significant relationships of QTL9.2 (GSL-ALK) with other QTLs. Aliphatic glucosinolates: 
GIV, Glucoiberverin; GIB, Glucoiberin; SIN, Sinigrin; GER, Glucoerucin; GRA, Glucoraphanin; GNA, Gluconapin; PRO, Progoitrin; ALY, Glucoalyssin; GBN, 
Glucobrassicanapin; ALIPH: sum of aliphatic GSLs; Indolic glucosinolate: GBS, Glucobrassicin; TOTAL: sum of total GSLs. Organs: L, Leaves; F: Flower 
buds; S: seeds. Continuous lines represent positive epistatic interactions while dashed lines represent negative epistatic interactions. 
doi:1 0.1 371 /journal.pone.0091 428.g003 



Distributions of individual and sums of GSLs were in most cases 
transgressive. These types of segregations liave been described 
before for GSL content in Brassica [40,41] and could be due to new 
combinations of additive alleles or to epistatic interactions among 
loci for GSLs, which have already been described [42,43]. 

Total GSL content varied considerably depending on the organ 
under study. As it was expected, seeds accumulated the highest 
GSL content followed by flower buds and leaves. After studying 
the GSL content in different organs of ^. thaliana. Brown et al. [17] 
found that seeds had the highest concentration followed by 
inflorescences, sUiques, leaves and roots. Velasco et al. [20] found 
that the GSL content in flower buds was higher than kale leaves 
[20]. These results may reflect the need to indicate de novo synthesis 
of GSLs and/ or mobilization [1 7] . 

The GSL profile also varied considerably depending on the 
organ. In fact, seeds were mostly composed of aliphatic GSLs, 
whereas indohc GSL were predominant in leaves. Flower buds 
had an intermediate profile. Besides, flower buds and seeds 
showed more diversification of aliphatic GSLs, since GIV and 
ALY were only found in seeds and GER was only found in flower 
buds and seeds. Agreeing with these results, kale leaves are 
characterized by high amounts of indolic GSLs during the fu'st 
plant stages, while aliphatic GSLs are predominant in flower buds 
and in leaves taken at the end of the vegetative stage [20]. A 



similar pattern was observed in A. thaliana, where seeds are 
distinguished by unique aliphatic constituents and low level of 
indolic compounds. After germination, the proportion of aliphatic 
GSLs declined with age, thus resulting in the predominance of 
indolic GSLs by the time of senescence [17]. 

QTLs analysis 

Seven out of 20 consensus QTLs determined the content 
exclusively in one of the three organs under study. Our results 
suggest that the regulation of genes underlying several QTLs is 
organ-dependent. Feng et al. [43] analysed QTLs for GSL content 
in leaves and seeds of B. napus and found 17 QTLs which were 
exclusively detected in leaves. Kliebenstein [23] found three 
organ-specific QTLs for aliphatic GSLs in both leaves and seeds of 
A. thaliana. A similar number was found for indolic GSLs. 

Aliphatic GSLs 

Several major loci determine the profile and content of aliphatic 
GSLs in Brassica [44]. The GSL-ELONG and GSL-PRO loci 
regulate the side chain length (Fig. IB). The presence of 4C-GSL is 
controUed by a dominant aUele of GSL-ELONG (GSL- 
ELONG+), whereas the presence of 3C-GSL is controlled by a 
dominant allele of GSL-PRO (GSL-PRO+) [45]. GSL-ALK 
controls side chain desaturation. The presence of GSL-ALK+ in 
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3C-GSL determines the production of alkenyl GSL. GSL-OHP 
catalyzes production of 2-hydroxypropyl GSL, but this GSL was 
not detected in parents or the mapping population. GSL-OH 
controls PRO production and its action is conditioned by the 
presence of GSL-ALK+ [45]. After analyzing parents of the 
mapping populations, it can be concluded that the genotype of Pi 
is GSL-ELONG+, GSL-PRO+, GSL-ALK+ and GSL-OH+, 
while the genotype of Pj is GSL-ELONG+, GSL-PRO-, GSL- 
ALK-. Because P^ is GSL-ALK- and the presence of GSL-ALK+ 
is needed in order to produce hydroxylated GSL, the genotype for 
the locus GSL-OH could not be determined. GSL-ELONG 
cannot be located into the mapping population, because both 
parents had the same genotype for this locus. Primer pairs 
amplifying loci GSL-PROb and GSL-OH designed by Gao et al. 
[15] were located in the mapping population in different positions 
as those reported by the authors, thus probably indicating an 
unspecific amplification of PGR products. 

Consensus QTL-5.1 controls the amount of three 3C-GSLs: 
GIB, GIV and SIN. Alleles for increasing 3C-GSLs content are 
given by Pi. Thus, GSL-PRO would be a good candidate gene for 
this QTL. This major locus was cloned [14] and mapped at the 
top of C5 in B. oleracea [15]. Position of C5 markers in the map of 
Iniguez-Luy et al. [22] is inverted with regard to C 5 in the map of 
Gao et al. [15]. Taking this into account, the position of QTL-5.1 
coincides with that of GSL-PRO. This information together 
supports the validation of the candidate gene. This QTL also 
controls the content of two indolic GSLs GBS and NeoGBS. 
Aliphatic and indolic GSLs are synthesized and subserjuently 
modified by two independent parallel pathways [46]. However, 
there are cross-talks between both pathways. Wentzell et al. [46] 
found that GSHMD0LIC.IV.8 and GSL.IMDOUC.V.20 QTLs, 
which control the content of several indolic GSLs in A. thaliana, 
map in the same genomic locations as GSL-AOP and GSL- 
ELONG loci which control aliphatic GSLs [46]. 

Consensus QTL-9.2 controls the amount of several GSLs. 
Alleles for increasing alkenyl GSL content (SIN, PRO, GNA) are 
given by Pi, while alleles for increasing non alkenyl GSL content 
(GRA, GER, GIB) are given by Pg (Fig. IB). Locus GSL-ALK was 
studied and cloned by Li and Quiros [13] and mapped in C9 [15] 
in the same position as QTL-9.2. Consensus (,)TL-3.1 controls the 
amount of GNA and its hydroxylated form PRO (Fig. IB). 
Curiously, alleles for increasing GNA content are given by 
Piwhich is GSL-OH+, while alleles for increasing PRO content 
are given by P2. This makes us think that Pg is also GSL-OH-)-. 
The function of this QTL would correspond to gene GSL-OH. 
Gao et al. [15] mapped this gene in C9, close to GSL-ALK. The 
position of the gene does not correspond to QTL-3.1. After 
searching in the whole genome sequence of A rapa, Zang et al. [47] 
and Wang et al. [48] found GSL genes homologous to those of A. 
thaliana. Three different copies of gene GSL-OH were found in B. 
rapa due to the triplicate nature of its genome [48] . Several copies 
of the same genes could also exist in B. oleracea. 

During the first stage of the development of the core structure of 
aliphatic GSL (Fig. I), the gene CYP79F1 metabolizes mono- to 
hexahomomethionine into their corresponding aldoxime in A. 
thaliana [49] . Primers designed in order to amplify this gene in B. 
oleracea [15] were employed in this work. CYP79F1 mapped in C5, 
in the same position found by Gao et al. [15], but no QTL was 
found in this position, thus indicating that both parents have the 
same allele for this gene. Consensus QTL-2.1 controls the content 
of total aliphatic GSLs in leaves and flower buds and the total GSL 
content in flower buds, but it does not control the content of any 
individual GSL, thus suggesting that the gene underlying this QTL 
may have a regulatory role in the aliphatic GSL pathway. Two 



R2R3-Myb transcription factors (Myb 28 and Myb 29) positively 
control biosynthesis of aliphatic GSLs in A. thaliana [50] and could 
be candidate genes for this consensus QTL. 

Indolic and aromatic GSLs 

In the first stage of the development of the core structure (Fig. 
lA) of indolic GSLs, two cytochromes P450 (CYP79B2 and 
CYP79B3) catalyze the conversion of Tip to indole-3-acetaldox- 
ime in A. thaliana [51,52]. Overexpression of CYP79B2 results in 
an increased accumulation of indole GSLs, specifically 3- 
indolylmethyl (GBS) and 4-mctli()xy-gluc<)l)rassicin (MeOH- 
GBS) (not detected in this work). In the next step, CYP83B1 
catalyzes the transformation of indole-3-acetaldoxime into to 5' - 
alkyl-thiohydroximate (Fig. lA) [53,54]. The Myb transcription 
factor ATRl from A. thaliana regulates the expression of genes 
CYP79B2, CYP79B3, and CYP83B1. Overexpression of ATRl 
leads to lines with higher levels of total indolic GSLs than wild- 
type plants [55]. CYP81F2 catalyzes the hydroxylation at position 
4 of the indole ring of GBS, which results in the formation of 
OHGBS and MeOH-GBS [56]. 

After in silico mapping of ^. thaliana GSL genes, CYP79B2 and 
CYP79B3 were located inside the confidence interval of consensus 
QTL- 1.2 and QTL-7.4. Both of them determine variation for 
GBS in seeds, agreeing with a possible high expression of 
candidate genes CYP79B2 and CYP79B3. 

SSRs Gil2 and O112-D05 map in both sides of ATRl gene of 
A. thaliana in chromosome 5 [25]. Gil 2 mapped in C9 in our work, 
where no QTL was dcttx ted. 0112-D()5 mapped within the 
consensus QTL-8.1 confidence interval. This QTL determines 
variation for OHGBS, NeoGBS and total indolic GSL content in 
the three organs analyzed. 

The high apparition of QTLs for indolic GSL content agrees 
with a high expression of ATRl candidate gene. Besides, aromatic 
GNT is also controlled by this QTL. Aromatic GSLs are also a 
substrate of CYP83B1, regulated by ATRl. These results together 
suggest that ATRl could be a possible candidate gene for QTL- 
8.1. 

Consensus QTL-2.1 determines variation for NeoGBS and 
GBS in flower buds and seeds. Candidate gene CYP81F2, 
metabolizing the step from GBS to NeoGBS from A. thaliana, 
was found in the confidence interval of this QTL. 

The B. oleracea whole genome sequencing is currently carried out 
by using TO1000DH3 as the reference genome. Sequences are 
being aligned by using mapping population BolTBDH. B. oleracea 
sequencing project will be a great opportunity to link sequences 
with the QTLs described in this work. 

Epistatic networks 

Significant epistatic interactions were found for the three organs 
under study. On the contrary of what was found by Feng et al. [43] 
in B. napus, part of the interactions were common among organs. 
The number of interactions was higher in flower buds, thus 
indicating a more complex regulation of GSL biosynthesis in this 
organ. Epistatic interactions for indolic GSLs were less complex 
than for aliphatic GSLs. 49% of the epistatic interactions detected 
were between QTLs, thus indicating that variability for GSLs 
content is determined directiy by QTLs and indirectly by 
interacting with other loci. 

Epistatic interactions among GSL-ALK, GSL-PRO and GSL- 
OH, determine variability for aliphatic GSL content and have 
been described before (reviewed by Kliebenstein [44]) in A. 
thaliana. They are mediated by transcriptional factors. In this work 
we have found that GSL-ALK plays a central role in the network 
of epistatic interactions for aliphatic GSLs, suggesting a possible 
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regulatory effect of this locus. Indirectly, GSL-ALK also controls 
the variability for the indolic GSL named GBS, thus indicating 
cross-talk between indolic and aliphatic pathways. This informa- 
tion supports the results found by Wentzell et al. [46] in A. thaliana. 
These authors transformed a null accession for AOP2 and AOP3 
genes (GSL-ALK locus) with AOP2 gene from B. okracea, thus 
resulting in the production of alkenyl GSLs, doubling of total 
aliphatic GSL content and the induction of aliphatic GSL 
biosynthetic genes and regulatory genes. 

Conclusions 

An extensive analysis of QTLs controlling GSL variation in 
three different organs of B. okracea has been presented. Possible 
candidate genes for different QTLs have been proposed based on 
the phenotypic study of the progeny and on the synteny with A. 
thaliana. Epistatic interactions among QTLs have been detected 
showing a central role of GSL-ALK in determining aliphatic GSL 
variation and suggesting a regulatory effect of this locus. Further 
work is going to be carried out in order to validate them and to 
find new candidate genes for remaining QTLs. 
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